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Universal quantum simulation may provide insights into those many-body systems that cannot be 
described classically, and that cannot be efficiently simulated with current technology. The Trotter 
formula, which decomposes a desired unitary time evolution of the simulator into a stroboscopic 
sequence of repeated elementary evolutions, is a key algorithmic component which makes quantum 
simulation of dynamics tractable. The Trotter number n sets the timescale on which a computer 
running this algorithm is switched from one elementary evolution to another. In the ideal case, the 
precision of the simulation can be arbitrarily controlled by increasing n. We study a more realistic 
scenario where each gate is applied imperfectly. The resultant tradeoff in errors leads to an ultimate 
limit on the precision of the simulation. We calculate the optimum Trotter number n* that achieves 
this limit, which is the minimum statistical distance from the actual simulation to the ideal one. 


I. INTRODUCTION 

Simulation involves the control and study of a first 
physical system (the simulator) in order to provide in¬ 
sight into a second (the simuland). It is an indispens¬ 
able tool in modern science. Under a classical computing 
paradigm, a huge variety of simulands are investigated 
using silicon based digital machines, which represent and 
process classical information stored as binary data in 
memory. For a broad class of problems, the simulation 
is efficient - the simulator requires spatial and tempo¬ 
ral resources scaling polynomially rather than exponen¬ 
tially in the size of the simuland. When the simuland is 
such that a fully quantum mechanical model is necessary 
to describe it, however, the memory requirement for a 
direct and exact simulation scales exponentially in the 
number of particles. Examples include quantum chem¬ 
istry simulations [1] and simulations of condensed matter 
systems Hi]. Often a mean-field approximation or cut¬ 
off in long-range entanglement can reduce the complexity 
and make the problem tractable once more. 

The idea of quantum simulation, on the other hand, 
where the computer can store and process quantum in¬ 
formation, offers the potential to explore quantum simu¬ 
lands without making such assumptions. Quantum sim¬ 
ulators are typically thought of as being analog or dig¬ 
ital [Sj. Analog quantum simulators are purpose built, 
and have limited reprogrammability. They often process 
information stored in continuous variables, and therefore 
do not benefit from error correction. The concept is sim¬ 
ilar to the idea of an orrery, or a scale model. 

Digital or universal quantum simulators, however, are 
anticipated to operate under the quantum computa¬ 
tion paradigm [6H9|. Although they may in fact have 
more modest requirements, for our purposes they will 
be thought of as fully fledged quantum computers. The 
advantage of this type of simulation is that a single com¬ 
puter can be reprogrammed to perform any simulation, 
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as long as it has enough memory. Perhaps more impor¬ 
tantly, the error in the outcome of the calculation can be 
controlled by error-correction methods. 

Quantum computers are typically constructed with 
two-level systems known as ‘quantum bits’. Such ele¬ 
ments may be realised with trapped ions [3 |j supercon¬ 
ducting circuits HI HO], or other candidate systems m- 
The formal requirements for quantum computation |12j 
dictate that a fiducial initial state can be prepared, that 
a universal set [IMil of operations can be applied, and 
that projective measurement is possible. 

Universal quantum simulation, being an instance of 
quantum computation, is therefore composed of prepa¬ 
ration, evolution and readout stages. Analyses concern¬ 
ing state preparation and data extraction can be found 
elsewhere HSlllT]. The aim of this paper is to quantify 
the accuracy of the evolution stage when the control op¬ 
erations are faulty. To that end, in Section |TT| we discuss 
the algorithms used for quantum simulation and recap 
arguments concerning accuracy with ideal operations; in 
Section |III| we discuss a general framework for model¬ 
ing faulty operations; in Section |IV| we discuss various 
ways to quantify the accuracy of a quantum computa¬ 
tion. Our main results are found in Section |V| where 
we argue that a tradeoff between Trotter error and gate 
errors is generic to universal quantum simulation and ex¬ 
hibit this with several examples. Our results show how 
to operate a faulty simulation in the optimal way, and 
the level of precision expected at this optimum. 


II. SIMULATION ALGORITHM 

Even when a quantum computer with a universal gate 
set is available, one does not generally know how to com¬ 
bine the gates efficiently to achieve a particular desired 
evolution. 
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A. Lloyd’s algorithm 

Lloyd’s algorithm [5] is a general but approximate solu¬ 
tion to this problem, when the desired evolution is known 
to be generated by a local Hamiltonian: 

k 

h = Y.h,, ( 1 ) 

i=i 

where each of the k component Hamiltonians Hj has di¬ 
mension less than some maximum (call this g). The true 
evolution is then approximated through a truncation of 
the Trotter [TH] formula: 


U = exp[iHt] 




( 2 ) 


where we set h = 1, and take H to be time independent 
(although this can be straightforwardly relaxed). Fur¬ 
ther, we rescale our units such that time t is dimension¬ 
less. 

The advantage of Lloyd’s approach is that the number 
of operations required is bounded by a number propor¬ 
tional to where t is the simulation time, g is the 

maximum dimension of the local Hamiltonians, and e is 
the desired error j^. The total number k of component 
Hamiltonians has a better-than-polynomial (rather than 
exponential) dependence on the particle number, making 
Lloyd’s method ‘efficient’. 

It is clear that for any finite value of n (the ‘Trotter 
number’), the higher order terms in the above equation 
will be non-zero. Their neglect then leads to an error 
in the simulation - the simulator is not driven to the 
desired final state but to one that is nearby. As we shall 
see, ‘nearby’ can be given a concrete mathematical and 
operational definition. 

Higher Trotter numbers result in trajectories of the 
simulator that result in a final state that is closer to the 
ideal. Increasing n will generally increase the computa¬ 
tional complexity - more control operations. More time 
may not be required, however, because the gates are cor¬ 
respondingly shorter: although this appears not to be 
the case in error corrected implementations‘ |19j . Others 
have considered the dependence of the number of gates 
and simulation time on the desired error [19) . but here 
our only concern is the overall accuracy of the simula¬ 
tion. Higher order approximants are available [201111], or 
other techniques that exploit sparsity [22]. The common 
attributes of these approaches are that they reduce the 
number of operations to polynomial in the particle num¬ 
ber, and that their accuracy is controlled (improved) by 
increasing the number of applied operations. The number 
of Trotter steps necessary for quantum chemistry simu¬ 
lation was considered in Refs. [231 HI]- 

Throughout this paper we take Vj = exp[iHjt/n] as 
primitive operations applied to the simulator, although 


ultimately these primitives should be understood as be¬ 
ing composed from gates drawn from the particular uni¬ 
versal set that is available. 


B. Time-energy freedom 

Because of the H —>■ aH^t —>■ t/a symmetry of the 
Schrodinger equation = —iL|'0), when simulating 

closed-system dynamics one benefits from the freedom to 
define 


.^simulator ^.^siniuland 


_ 1 

^simulator — ^simuland, 

a 


(3) 


which will preserve the correspondence between the time 
evolutions of the simulator and simuland. The simula¬ 
tion time, therefore, may be chosen to be any duration as 
long as the appropriate global scaling of the energy of the 
control fields is also performed. Even classical simulators 
are rarely operated at a speed commensurate with their 
respective simulands- weather patterns of several weeks 
are simulated in a matter of hours, and supercomput¬ 
ers spend months calculating chemical reaction dynam¬ 
ics over timescales many orders of magnitude smaller. 
In fact, because each unitary exp[iHjt/n\ is decomposed 
into the natural elementary gate set of the simulator, 
one generally expects a ^ 1. Clearly this freedom en¬ 
ables the computation to be sped up (or slowed down) 
by a constant factor |25j . but more importantly we imag¬ 
ine that such freedom may prove very useful for reducing 
noise in implementations of quantum simulation, depend¬ 
ing on the particular noise which dominates - see below. 
It is worth noting that assuming this freedom is asking 
more than is necessary for universal quantum computa¬ 
tion - nevertheless in many quantum computers we ex¬ 
pect there to be at least a limited ability to perform the 
primitive operations at different physical speeds. 


III. NOISE TYPES 

The error in approximating the true evolution with the 
first term on the right-hand-side of ii) is known to decay 
at worst as oc n~^ [26j . and as Lloyd put it ‘n can always 
be picked sufficiently large to ensure the simulator tracks 
the correct time evolution to within any [desired nonzero 
accuracy]’ |5]. The perfect control of any system (quan¬ 
tum or classical) is only ever an idealisation however. 
Little is known about the real-world situation - although 
recently the Trotter decomposition has been shown to be 
stable in the sense that the overall error can be reduced 
to a fixed desired level if the precision of the individual 
steps is good enough m- Ref. [211 i® S' study of the influ¬ 
ence of noise on certain quantum simulations, calculating 
the average fidelity of the final state of the computer with 
the ideal. Here we derive analytical results for arbitrary 
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(generally non-unitary and non-commuting) component 
Hamiltonians. Further, as we shall show, our results al¬ 
low us to predict the ultimate performance of a faulty 
quantum simulator, and allow us to prescribe the opti¬ 
mum Trotter number to employ. Our results hold for a 
whole class of statistical distance metrics, including the 
most interesting worst-case metrics. 

We construct ‘faulty Trotterized quantum channels’, 
and consider the following generalised noise map, with 
Vj(p) = representing the component 

unitary processes of Trotterization (here describing the 
transformation of a d x d density matrix p describing the 
quantum state of the simulator): 

^faultyTrotter(^) = Otl 0"=1 O V,{p) 

= Or=i (• ■ • o £i2 o V 2 O £ii O Vl) [p] 

= OU£l^^%p)- (4) 

The o symbol is used here to denote the concatenation of 
quantum channels: here we have a triple concatenation 
(first due to faulty evolutions £ij following the clean ones; 
second due to applying each of the k local Hamiltonians 
in turn, and third by repeating this process n times). 
The interleaved operations £ij are unwanted evolutions; 
they carry an index i to emphasise that they may vary 
over the course of the experiment, and there may also 
be a dependence on {Hj}, t and n. In the course of our 
derivations it is useful to consider the map over a single 
Trotter iteration (faulty Trotter single step) 

It will be convenient to consider the supermatrix rep¬ 
resentation of these maps: this is defined by Tfp = £{p], 
where p is the vectorized (column-stacked) density ma¬ 
trix and Tf is a supermatrix of dimension <P. This rep¬ 
resentation makes the semigroup structure of quantum 
maps apparent. The concatenation of channels is merely 
matrix multiplication: 

n k 

r-|-ifaulty Trotter =nn T£.,Tv, 

n 

= (... Tf-jTvjTf.iTvJ 

i=l 
n 

=nTp^- (5) 

i=l 

We write the perfect implementation of the Trotter 
technique as V with unitary matrix V — { yT)" and 
supermatrix Ty = (l/Ty)" - essentially by removing all 
the faulty maps from the faulty Trotter channel. 

We write the supermatrix representation of the ideal 
map U as \J = U (£) U*, and break up this unitary evolu¬ 
tion as 

Uip)=0-Vuip) (6) 

with the notation justified by the supermatrix describing 
each ideal Trotter step U = (i/U)". 


IV. STATISTICAL DISTANCE MEASURES FOR 
QUANTUM CHANNELS 

Hauke et al. pose the question - can we trust quan¬ 
tum simulators? [29j . Rather than give a binary answer, 
one can instead ask the question ‘to what extent can 
we trust quantum simulators’ ? We aim to provide an 
answer to the above question in the form of a discrimina¬ 
tion probability. To measure the faithfulness of quantum 
operations, one can appeal to generalisations of either 
classical fidelity or classical statistical distance between 
probability distributions. We prefer the latter here: the 
trace distance between quantum states (density matri¬ 
ces) has an operational meaning because it determines 
the success probability for the state discrimination prob¬ 
lem [5D]. One is provided with either pA or pb with equal 
probability and is asked to guess which after a single mea¬ 
surement. The probability of success is 

Pdistinguish ~ ^ 11 ■ (?) 

Here 

Wpa - PbW = Tr|p^ - pb\ 

= Try ^{pA - Pb)[pa - Pb)‘^■ (8) 

The norm of the difference between the operators is a 
metric, giving the the quantum statistical distance be¬ 
tween the operators. This definition implies that the 
discrimination is informed by measurement results aris¬ 
ing from the optimum choice of POVM (Positive Opera¬ 
tor Valued Measure), or generalised measurement proce¬ 
dure [31]. In fact 

IIpa - PbII = 2maxTr(F;(pA - Pb)), (9) 

h/ 

where E is a positive operator (or POVM element). A 
similar maximisation over states in turn induces a dis¬ 
tance on quantum channels, defined as 

- ^bIIo,! = max ||(£:^ (g) I)(p) - (£b (g)I)(p)||. 

pfc O, iJsep 

( 10 ) 

The associated task is to submit an optimal initial state 
p* to undergo evolution under £a or £b (chosen at 
random), and to perform the state discrimination task 
(above) on the output state. For full generality the en¬ 
larged search space S (having dimensions d^) is needed to 
allow for the possibility of entanglement with an ancilla 
(of dimension no larger than that defined by the chan¬ 
nels themselves) assisting in the channel discrimination 
task [321 ■ When the full search space S is available, 11 • | |o 
is the diamond norm |32] : otherwise, when the maximiza¬ 
tion is over states which factorize into system, ancilla 
states, the norm is the unstabilized induced trace-norm 
II • 111 (and the ancilla plays no role). A further restric¬ 
tion considered by Lloyd [S] would be to define the set 
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of states of ‘interest’ S-mt' but this has the drawback of 
requiring detailed knowledge of the particular simulation 
at hand, and makes additional assumptions on the set of 
initial states of interest: see Figure 

These metrics then give the bias away from a half in 
the probability of discriminating between the real and 
ideal channels, given a single optimal initial state prepa¬ 
ration (for the diamond norm over a larger space), a sin¬ 
gle channel use and a single sample from an optimally 
chosen measurement basis (again for the diamond norm 
on a larger space): 

Pdistinguish ^ 2 (H) 

They are therefore worst-case metrics. Yet another norm 
to consider is the J-norm || • ||j: this provides a bound 
on the average trace distance over a uniformly measured 
state space [33]. The J-distance between two quantum 
maps is merely the trace distance of the associated states 
in the Jamiolkowski isomorphism |34j : 

\\£A-£B\\j = \\Ji£A)-Ji£B)\\, (12) 

where J{£) oc £{\i){j\) ® |*)(j| with the constant of 
proportionality such that J{£) has unit trace. The J- 
distance is related to the average probability of discrim¬ 
inating the real and ideal simulations, given an optimal 
measurement: 

Pdistinguish — W^A 11 j ■ (13) 

For a comprehensive survey of available metrics, see 
Ref. |33| |3S]. 

There are a number of senses in which these metrics 
are pessimistic measures. Firstly, for the worst-case met¬ 
rics, an optimal input state (possibly entangled with an 
ancilla) may not arise in a quantum simulator: readout 
methods such as the phase estimation algorithm [33] , for 
example, do not employ such states. Currently, the full 
set of preparations and measurements at the disposal of 
a quantum computer are not made use of in quantum 
simulation algorithms: in the future, more involved pro¬ 
cedures (including readout using an ancilla) may turn 
out to be more powerful at extracting information. Sec¬ 
ondly, the maximisation over POVMs: in reality, the 
measurements are likely to be restricted to those observ¬ 
ables necessary for calculating macroscopic quantities, 
e.g. the ground state energy of the system, the Shannon 
entropy, effective temperature, net magnetisation and so 
on. Consider that these measurement choices might be 
completely impervious to certain types of error, but yet 
the errors are ruthlessly sought out and exposed nnder 
the trace-norm or diamond-norm distances. In other 
words, there may be an equivalence class of (possibly mu¬ 
tually orthogonal) microstates with the same macrostate 
(or property of interest). An ‘incorrect’ microstate could 
well be good enough for the purposes of the simulation 
(even yielding precisely the correct ‘answer’) whilst being 


input state , , output state 

channel .... 

preparation discrimination 



FIG. 1. (Color online) The distinguishability of two quantum 
channels £a and £b can be defined as the distinguishability 
of the outputs of each channel when the input is an optimally 
chosen quantum state p*. The optimum is generally drawn 
from a set S describing joint states of an enlarged system. The 
subset Ssep, contains states with no system-ancilla entangle¬ 
ment and a further subset Sint, which contains only physical 
states of interest. 


highly (or even perfectly) distinguishable from the cor¬ 
rect state. Consider for example a very large simulator 
suffering only a bit flip on a qubit corresponding to the 
least significant digit of the readout register (minimal ef¬ 
fect on readout result), or even a simulator which suffers 
a severe scrambling of phase relations prior to measure¬ 
ment in the computational basis (potentially zero effect 
on readout result). The game of simulation is not neces¬ 
sarily an adversarial one - nevertheless due to the often 
random and potentially signihcant effect of errors, worst- 
case metrics are usually regarded as the most relevant 
measures to employ. 

All metrics satisfy some important properties which we 
will make use of: the triangle inequality ||£^ —fell < 
\\£a-£b\\ + \\£b-£c\\ and convexity \\iZiPi£i\\ < 
Another important property satisfied 
by all metrics considered in this paper is chaining: 

I \£At ° £A'2 - ^Bi ° £b2 11 < I I^Ti - £bi 11 + \ \£a2 - £b2 11 ■ 

The diamond norm and J distance are special in that 
they are stable: they satisfy ||I(8)£||^j = ||£||^ j. See 
Gilchrist et al. |33| for further details. 

The convexity property is an important one, and is 
enough to ensure that the maximum statistical distance 
is achieved on a pure state (33). Further, although often 
understood as the inability to increase distinguishability 
through averaging (i.e. as a handicap in information pro¬ 
cessing tasks such as parameter estimation IS]), here it 
implies that the performance of a faulty quantum com¬ 
putation may be improved by repeating the computation 
and averaging the results. As we shall see below, this 
can lead to quite significant improvements, because the 
fluctuations may be suppressed. 

A related fact concerns appropriate worst-case bench¬ 
marks. Whilst the ideal evolution U is unitary and will 
take a pure state to a pure state, a completely noisy 
channel =l/d destroys the purity of any state 

(having dimension d). One can prove [33 that under the 
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unstabilised trace norm one has 


gP^Vd _ ^ 



(14) 


whereas with the diamond norm |38j and J-distance 
(which we concentrate on for the remainder of this ar¬ 
ticle) one has 


gP^l/d _ 


o,J 


= 2 - 


2 


(15) 


These values represent benchmarks - the distinguisha- 
bility between the output of the ideal simulation and 
complete noise. Any simulation giving a higher distance 
that this will be worse than a completely random out¬ 
put! Note that both norms converge to the algebraic 
maximum of 2 as d —> oo. The algebraic maximum is 
achieved, for example, by an unwanted bit flip. 


V. RESULTS 


We are now in a position to calculate the accuracy of 
a realistic quantum simulator. To that end, we apply the 
norms introduced above to calculate the statistical dis¬ 
tance from the ideal map to the faulty Trotter map. To 
see why we expect an optimum Trotter number to emerge 
in realistic universal quantum simulators, consider hrst a 
simple application of the triangle inequality to the dis¬ 
tance between ideal and realistic channels over a single 
Trotter step: 




< 




-h 


■s/V - 


(16) 


Next, assume the first term is bounded by a quantity oc 
for some s > 2: this captures the first order Trotter 
formula that we study here (s = 2), as well as higher 
order expansions [5D]. Now assume the second term is a 
constant with respect to n. This immediately leads, via 
the chaining property, to 

D:=\\U- £f“ltyTrotter I I < 

M M - „s-l V J 

for some constants and It is simple to show that 

the optimum Trotter number is 


n = 






(18) 


and the statistical distance at this optimum is 


Din) 


s — 1 V ^ 


(19) 


Depending on the values of and Si (which also depend 
on s in the general case), one is able to also optimise over 
s to find the best order Trotter formula to use. For the 


remainder of this paper, however we set s = 2 and these 
quantities reduce to 


and 



( 20 ) 


D(n*) = 2V¥S. (21) 


As we shall show, is hxed by the pair while S 

is determined by gate errors. Below we investigate some 
example noise models that influence S. 


A. Mistimed control 


Trotterization fundamentally requires ‘switching’ be¬ 
tween unitary gates: in our simplification this is thought 
of as switching each component Hamiltonian Hj on for 
a specific duration. Consider that the duration is in¬ 
creased (decreased) by a random number Aij, normally 
distributed around zero with variance This makes 




( 22 ) 


Such imperfection is ubiquitous in the control of quantum 
systems, and an equivalent imperfection (laser intensity 
fluctuations) was cited as the dominant error source by 
Lanyon et al. in a recent implementation of universal 
quantum simulation [5]. Under this noise model, the to¬ 
tal map ffauityTrotter^ being a concatenation of random 
unitaries, is itself a random unitary map. Note that the 
magnitude of the errant time-shift A^- is independent of 
t or n, but the error map is more severe when ||i?i|| is 
large, i.e. the simulator is driven ‘hard’. 

As shown in Appendix by assuming 

^ 1 0116 oan find the difference 

between real and ideal supermatrices for this error 
model, finding 


i/U 


rpfTsS 

j<i 


Ol) 


2n2 


+ i ^(I ® H* - Hj ® I)Ay 


-f ^(I (g) H*H^ + HjHi ® I) 

]<i 


—Aij -\ — Ail + Aij Ail 

n n 




2 Aij + Aij 


3 1 


— ^ij H—Aj/ + A-ijAii 

n n 


( 23 ) 
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FIG. 2. (Color online) Diamond norm distance versus Trot¬ 
ter number for a quantum simulation of the Hamiltonian 
H = ax + (Jy over a duration t = 2. Each transparent marker 
represents a single Monte-Carlo simulated random unitary 
evolution of the simulator. The solid lines represent the per¬ 
formance of the averaged map and the dashed lines are the 
average of the individual Monte-Carlo runs - lower is better. 
Note how the position of the averaged map is below the aver¬ 
age position of the markers. The performance of the average 
map is better than the average performance of the maps of 
which it is composed. The two colours correspond to different 
noise characteristics of the mistimed control noise model. 


Note that the terms linear in A work to introduce ‘coher¬ 
ent’ or reversible errors that may be averaged away be¬ 
cause of our assumptions A^- = 0 and A^fcAim = 
hence 


By the chaining property we reach 

t‘2 

^MTC ^( 26 ) 

with 


^ = 


^ (I 0 [Hy,Hi]* + [H„Hi] O I) 

3<l 





+ 2Hj ® H*'^ 


(27) 


Comparing (26) with (17), we see that j2 = ^ and 

Note that we leave the choice of norm free 
here: ? S {o, 1, J,...}. We made use of only the triangle 
inequality and the chaining property. As we will show, 
if .c/ : happens to be invariant under choice of norm 

the optimum Trotter number is also an invariant, since 
it only depends on this ratio. Note also that these are 
norms of supermatrices: the supermatrices may need to 
be converted to another form in order that the norms are 
evaluated. 

Applying the triangle inequality to and SS we note 
that the first quantity is upper bounded by the sum of 
at worst ^(fc^ — fc) norms, and the second by the sum 
of only k norms. Since k is polynomial in the particle 
number, the Trotter error (sometimes called the digital 
error) therefore grows only polynomially in the size of the 
simuland. 

Taking n S K for the moment, simple analysis yields 
i) the optimum Trotter number 


”Mtc = \ (28) 


^ - Tf- = ^(I 0 01) — 


J<i 


+ i ^(I 0 Hf + H] 0 1)^2 


- ^ 0 + ... 


(24) 


ii) the statistical distance at this optimum 

-Dmtc(^*) = (29) 

and iii) the maximum simulation time after which the sta¬ 
tistical distance is above the accepted level Umax, given 
by 

tmax — C^max /(crV2i/^). (30) 


leaving the ‘incoherent’ or irreversible errors (quadratic 
in A) remaining. Clearly averaging can help reduce fluc¬ 
tuations introduced by the noise. This is expected, be¬ 
cause of the convexity property of the norm. The re¬ 
sultant (less severe) map is denoted MTC, ?for averaged 
mistimed control?. Figure shows (through numerical 
simulations) how averaging can reduce the simulation er¬ 
ror for this noise model, as measured by the diamond 
norm distance. Taking the norm of (24) gives the sta¬ 


tistical distance, and employing the triangle inequality 
gives 


The meaning of equations (|28[),(29| and (30) is inherited 


from the choice of norm in the definition of and 
Of course in reality n* G Z: the true optimum must 
be a whole number of Trotter steps, and so n* should 
be rounded in the direction of the sign of [n*J [n*] — 
l{2^a'^). See Appendix [c| for more details. 
Although our formulae apply generally, as a case study 
we study a special case of the Ising Hamiltonian 


N 




H2 = Y,« 

{r,s) 


(31) 


_ 

MTC 




2n2 




( 25 ) 


with and being the Pauli matrices acting on qubit r 
of N, with identity matrices implied on other qubits. The 
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notation s) denotes a sum over nearest neighbours. 
Note that [Hi,H 2 ] ^ 0. As shown in Figure [^, our 
approximate analytical upper bound (261 is a good fit 


for exact numerics in the correct parameter regime (for 
example we chose f = 0.1 and N = 2). 

We note three regimes where the fit is worse: i) when 
||i7it/n|| « 1 (the perturbative expansion breaks down), 
ii) in the tradeoff region near the optimum Trotter num¬ 
ber n* (here the chaining inequality is loose) and iii) when 
the distance measure approaches its maximum (the errors 
are saturating, and the chaining inequality between con¬ 
secutive Trotter steps is loose). Despite the looseness of 
the fit in these areas, the location of n* and D(n*) are 
well captured. 

Aside from combatting the gate noise via averaging, 
one can apply the time-energy freedom noting that 


o ^ 1 D- 


MTC 


< 


- -h SSniaa)^. 

2n 


(32) 


Clearly one can now take a < 1, i.e. retard the simula¬ 
tor to decrease the absolute error at each Trotter step. 
This has the same effect as reducing a- the first term is 
unaffected. 


B. Trotter step induced depolarisation 


Next we study a different noise model, which we 
call the Trotter-step-induced depolarization, or DEPOT 
model. Depolarising noise is commonly used to phe¬ 
nomenologically model noise because it makes analytic 
results tractable. To gain an idea of how each operation 
might introduce generic noise into the simulator, we as¬ 
sume that after a single Trotter step the combined effect 
of imperfections in the applied gates is to depolarise the 
entire simulator uniformly. The faulty map is then 


^faultyTrotter(^) = O (. . . V 2 O Vi)(p) (33) 

= (34) 


with 


£F^^°^{p) = {l-p)p + ^I. (35) 

Here I is the d x d identity matrix. Note there is no de¬ 
pendence on any variable other than p, the probability of 
a depolarising error during one Trotter step. This quan¬ 
tity therefore captures the severity of the faulty control. 
Now 


-•fTss 


- Vw=(i-p) W 


+psp^y<i- ^ 


={i-p) 


i/v- ^ 


+p \£p^y‘‘- , 


(36) 


and we can use convexity to reach 


ffTss _ ^ 


<(1 -P) 




+ P 


^p^i/d _ ^ 


(37) 


Now the first term follows trivially from the above calcu¬ 
lation and equals (1 — p) . The second term is 

cr=0 

also known (see above). This means that (again employ¬ 
ing the chaining inequality over the n iterations of the 
single step channel, and assuming \\Hi\\t/n 1) 


/ 2 

Ddepol ^ (1 — P)'^^ 


which we can simplify, in the case of d 3> 1 to 


(38) 


Ddepol « (1 — p)'^^ X—f 2pn. (39) 

2n 

Qualitatively this is very similar to the MTC model. One 
cannot however leverage the freedom in the time-energy 
correspondence to effectively reduce p, the characteristic 
of the noise, because (391 is invariant under that trans¬ 
formation. One has 


^DEPOL — 

or in the case of d ^ 1 

^DEPOL — 


I (1 — p)^ 

2p(2 - ^) 


(1 — p)s/ 

Ap 


(40) 


(41) 


Note that, as with above, one should round n* to the 
nearest integer in the direction of the sign of [n*J |"n*] — 
{1 — p)j^t^/Ap. One has 


Ddepol(»^*) = 2t^/sxfp{l — p). (42) 

See Figure for comparison with numerical results, 
which show that the DEPOT noise model has qualita¬ 
tively similar behaviour to the MTC noise model and 
also that our approximate analytical approach continues 
to provide a good fit for the true performance. 


C. Decoherence 

To model the overall noise of a simulator under perfect 
control, we depolarise the simulator in a manner that is 
independent of the number of operations. This means the 
number of Trotter steps does not affect the total noise 
characteristics. A very similar argument to the above 
gives 

^faultyTrotter _ ^ _ p(t))[V - U] 

+ p{t)[£P^^/'^ -U] (43) 
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FIG. 3. (Color online) Log-Log plot of the statistical distance versus the Trotter number for a faulty simulation of the Ising 
Hamiltonian ( Eq. with N = 2 and t = 0.1. The noise model is either a) timing errors (the MTC noise model) or b) 

depolarising noise (the DEPOL model). The horizontal axis represents the frequency of applied operations, and the vertical 
axis is related to the probability that the actual simulation could be distinguished from the ideal simulation (lower is better). 
Solid colored lines (numerically calculated) correspond to the diamond norm distance, and the dotted lines are approximate 
analytical formulae. The different colours correspond to either a) the standard deviation a of the noise or b) the probability 
p of a depolarising error at each Trotter step, each shown in the legend. Black lines are the J-distance (solid lines are exact 
numerics, dotted lines are approximate analytics). The crosshairs show the analytically predicted optimum Trotter number and 
performance level at that optimum. Note the n —>■ oo behaviour shows the norms saturating to the completely noisy benchmark 
(see main text). 


and nowp(t) increases over time. Typically l—p{t) repre¬ 
sents exponential decay of population and coherence with 
time |31j . The first term follows trivially and is equal to 
(1 -P) ^mtcL=o- We have 


T’decoh < (1 - pit))-;: -1- Pit) (2-35- 


2n 




(44) 


Once more employing the time energy freedom (§ we 
find 


.Ddecoh < 1 — P 


2n 


2 - 


(P 


(45) 


note that accelerating the simulator a > 1 will improve 
performance if pit) is monotonically increasing. Note 
here there is no optimum Trotter number, because the 
noise is not worsened by increasing the number of oper¬ 
ations. 


VI. DISCUSSION 

By modelling some simple imperfections in the control 
of a Trotterized universal quantum simulator, we have 
shown how the accuracy of the quantum channel that is 
applied depends on various parameters. These parame¬ 
ters include those outside of direct experimental control, 


such as the severity of environmental decoherence or the 
spread in control timing; and also controllable quantities 
such as the Trotter number and the simulator/simuland 
time ratio. We suggested ways in which these quantities 
can be optimised over in order to improve the accuracy 
of the simulation, which we quantified by calculating the 
statistical distance to the ideal map. The optimisation 
implies only that the uncontrollable quantities be esti¬ 
mated from control experiments, and that (simply re¬ 
lated to s/ which measures the total non-commutativity 
of the local pieces of the Hamiltonian) and & (which 
measures the control error per Trotter step) be calcu¬ 
lated only once (possibly numerically). We discussed the 
choice of a number of norms for this purpose, including 
the worst-case induced-trace norm and stabilised (dia¬ 
mond) norm, as well as the average-case J-distance. 

In particular we found there to be a finite optimum 
Trotter number to employ, setting a maximum perfor¬ 
mance and a maximum simulation time. By way of a 
general argument, we have shown these features to be 
generic to faulty simulators operated with a Trotterized 
(or similar) algorithm. In order to predict these quanti¬ 
ties, one can appeal to a microscopic model of the sim¬ 
ulator - having a sufficiently good model for a physical 
system is arguably a prerequisite for using it as a com¬ 
putational device [5^ . 

The problem of noise in quantum computers has, in one 
sense, been solved by the error-correction threshold the- 
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orems [3DH13]- An error correction threshold is a critical 
value of a measure of the accuracy of quantum control. 
Once it has been surpassed, error correction techniques 
work to decrease the overall net error. These theorems 
show that, when experimental operations become clean 
enough, encoding logical qubits in a larger number of 
redundant physical qubits allows the error in the overall 
computation to be suppressed at will |31] . But as interest 
in simulators grows |45j . and small scale (say 64 qubit) 
devices begin to appear, the noise problem remains until 
many-thousand-qubit devices with sub-threshold phys¬ 
ical error rates can be engineered. Even after this is 
achieved, imperfections will not have completely disap¬ 
peared: the question of how the accuracy of overall com¬ 
putation depends on the necessarily finite residual (i.e. 
error-corrected) error rate |46j remains of high impor¬ 
tance, and our analyses here will still apply. 

We will briefly comment on two recent experimental 
demonstrations in superconducting systems. The Mar¬ 
tinis group perform an investigation of fermionic models 
with four transmon qubits [5] : but they fail to control the 
error in the simulation at all, operating in the gate-error- 
dominated regime n > n*. The Wallraff group, investi¬ 
gating interacting spin systems using only two transmon 
qubits, successfully sweep the Trotter number through 
the optimum point [47) . for various times t. By increas¬ 
ing the total simulation time they effectively increase the 
Trotter error until it competes with the other errors. Our 
results predict that, if both the Hamiltonian and error 
per Trotter step (i.e. and are kept constant then 
n*/t should be a constant: the experimental results are in 
agreement with this prediction. The worst trace distance 
in these experiments was of order 10“^. 

These experiments show the difficulty in achieving 
good confidence in quantum simulators, especially as the 
number of qubits is increased. Using our results (20) 
and we estimate that for the Wallraff experiment 
(fixing the Hamiltonian and simulation time) reaching 
the sorts of precision we are accustomed to with modern 
classical computers (the current standard ‘single preci¬ 
sion’ is 2“^^ « 10“^ [H]) will require a factor of at least 
10® more Trotter steps and therefore a 10^^ improvement 
in the error per Trotter step. This is due to the fact that 


the overall error is proportional to the square root of the 
error per Trotter step. 

Furthermore as pointed out in Ref. [13]) a full simula¬ 
tion using readout with the phase estimation algorithm 
calls for simulation not only of U but of further in¬ 
creasing the required number of Trotter steps and there¬ 
fore precision per step. As Wecker et al. show [33], the 
number of logical gates per Trotter step (for example in 
a quantum chemistry simulation) is roughy 0(iV®). This 
fact means that the error per gate may need to be several 
orders of magnitude smaller still. 

That simulations are only likely to provide novel in¬ 
sight when featuring many dozens of qubits implies that 
yet another improvement in gate quality is necessary. 
Consider an A^ = 64 qubit simulation with nearest neigh¬ 
bour interactions. Compared to the recent N = 2 exper¬ 
iment |47|, there will be a thousandfold increase in the 
number of commutators between the Hi] many of these 
may be expected to vanish but nevertheless we might 
expect £/ (and therefore ‘^) to increase by two or even 
three orders of magnitude, requiring an improvement in 
^ of comparable magnitude to compensate. 

Future work will investigate the multiparameter opti¬ 
misation when all noise types are present, the use of alter¬ 
native Trotter-type approximants, and the performance 
of simulators when the set of interesting states and ap¬ 
plied measurements is chosen from a restricted set. 
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Appendix A: Analytics 

We are interested in the difference between the faulty 
Trotter map described by the MTC noise model and the 
ideal map. It is convenient here to choose the super¬ 
matrix representation. For a single shot map over a single 
Trotter step 
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2n^ 
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(A2) 


which, after employing the identity 


--iE+E+E + E+^E = E-E h,hisY,w,.h,], 

j>l j<l j^l } I j^l j<l } I j<l j>l ) j<l 


gives 


i/U - = ^(I 0 [Hj,HiY + [Hj^Hi] ^l)—+t ;^(I 0 H* - Hj ® I)A, 

3<l 3 

+ ^(I (g) H*Ht + HjHi g) I) 

3<l 


—Ay H —Ail + AijAii 
n n 


2 —A ■ ■ + A^ 
n ^ 


-Y,H3®Ht 

3^ 


—Ay H —Ail + AijAii 

n n 


which is the result in the main text. 


(A3) 


Appendix B: Numerics 

The diamond norm distance between two unitary maps 
described by matrices U and V is simply the diame¬ 
ter of the smallest enclosing circle of the eigenvalues of 
UV^ inni- In our numerical work we used python code 
from Minase m- 


To calculate the diamond norm for a general map ex¬ 
pressed in supermatrix, Choi or Jamiolkowski form, we 
used a semidefinite programming |521153j algorithm due 
to Watrous [54l[55], implemented by Johnston in a mat- 
lab package called qetlab [56]. 
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1. Total noise and depolarisation 


2. Averaged Mistimed Control 


What is the supermatrix form of = I/d? 

One can exploit the trace of the density matrix to get 
the following supermatrix: 


The quantum channel under consideration is what we 
call the averaged mistimed control (MTC) model. It is 
the average map that is applied when the timing suffers 
a nonzero spread. 


rj^p — >-I/d _ 





.. 0 

1 

d 

0 .. 

.. 0 

0 

0 .. 

.. 0 

0 

0 .. 

.. 0 

1 

d 

0 .. 

.. 0 

0 

0 .. 


—faulty Trotter 


= nTf" = 1111^50 (B2) 


* J 


here we have exploited the independence of the random 
(Bl) (matrix) variables to distribute the expectation over the 
concatenation of channels. Now for the MTC model the 
interleaved maps can be written 




(B3) 


each ellipsis denotes that there are d zero entries between 
the nonzero elements. Clearly each off-diagonal is anni¬ 
hilated and each diagonal goes to (Trp)/d. From here 
we can easily construct a supermatrix representation of 
fDEPOL ^ Jqj. example by scaling by p ^nd adding 

(1 -P)ld2xd2- 


with the probability distribution p{Aij) taken as a prod¬ 
uct of independent Gaussians with zero mean and stan¬ 
dard deviation of a. Now, expanding the density matrix 
p in the energy eigenbasis, each operation merely gener¬ 
ates a phase and we can evaluate the integral (here we 
suppress the Hamiltonian index j for clarity): 


P^ ^ e i'^pjnn\Em){En\ 6 "^^'^dAij 


1 




^ Prnnl^m) {^n\ j 






Pmn \E^){En 


(B4) 

(B5) 

(B6) 


We defined pmn = {Em\ P \En) with H \Em) = E^ \Em)- 
Observe how the map acts on energy eigenstates: as 
an element-wise product p ^ A ■ p with a matrix hav¬ 
ing entries Amn = e“ 2 (Em-E„) a ^ If we define W 
as the unitary matrix changing from the canonical into 
the energy eigenbasis, then the map acts on any p as 
p^W\A-WpW^)W. 

The supermatrix Ta corresponding to the elementwise 
product can be found by reshaping A into a vector and 
then constructing a diagonal matrix from this vector. 
The advantages of using a supermatrix is the composition 
of channels is simply matrix multiplication. The map can 
be defined for any density matrix in the following way 

= TwtTATwp (B7) 

and Tw = IT G W* and so on. 

By using these techniques, one is able to build up the 
total (averaged) faulty Trotter map in supermatrix form, 
which is convenient for numerical calculations. 


Appendix C: Finding n* as an Integer 

Using calculus, i.e. assuming n to be a real number, 
will generally give a non integer solution for n*. We can 
always round this up (or down) to get an integer, and in 
the worst case we may round in the wrong direction. Here 
we briefly show how to make sure the integer chosen is 
the correct one. To ensure the lowest value for D, clearly 
we should choose the ‘rounded down’ value [n*J or the 
‘rounded up’ value [n*] when the quantity 

Di\nn)-D{[n*\) (Cl) 

is positive and negative respectively. When it is exactly 
zero, we can choose either. Let [n*J = k and let 

D(k) = '^ + 9k. (C2) 

k 

and our condition simplifies to 

fc(fc + l)-|. (C3) 
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This justifies the claim in the main text that one should 
round n in the direction of the sign of |"n*] [n*J — {'rf /^). 
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